skip to main content


Search for: All records

Creators/Authors contains: "Katsaggelos, Aggelos"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    The Gravity Spy project aims to uncover the origins of glitches, transient bursts of noise that hamper analysis of gravitational-wave data. By using both the work of citizen-science volunteers and machine learning algorithms, the Gravity Spy project enables reliable classification of glitches. Citizen science and machine learning are intrinsically coupled within the Gravity Spy framework, with machine learning classifications providing a rapid first-pass classification of the dataset and enabling tiered volunteer training, and volunteer-based classifications verifying the machine classifications, bolstering the machine learning training set and identifying new morphological classes of glitches. These classifications are now routinely used in studies characterizing the performance of the LIGO gravitational-wave detectors. Providing the volunteers with a training framework that teaches them to classify a wide range of glitches, as well as additional tools to aid their investigations of interesting glitches, empowers them to make discoveries of new classes of glitches. This demonstrates that, when giving suitable support, volunteers can go beyond simple classification tasks to identify new features in data at a level comparable to domain experts. The Gravity Spy project is now providing volunteers with more complicated data that includes auxiliary monitors of the detector to identify the root cause of glitches.

     
    more » « less
  2. Abstract The relationship of human brain structure to cognitive function is complex, and how this relationship differs between childhood and adulthood is poorly understood. One strong hypothesis suggests the cognitive function of Fluid Intelligence (Gf) is dependent on prefrontal cortex and parietal cortex. In this work, we developed a novel graph convolutional neural networks (gCNNs) for the analysis of localized anatomic shape and prediction of Gf. Morphologic information of the cortical ribbons and subcortical structures was extracted from T1-weighted MRIs within two independent cohorts, the Adolescent Brain Cognitive Development Study (ABCD; age: 9.93 ± 0.62 years) of children and the Human Connectome Project (HCP; age: 28.81 ± 3.70 years). Prediction combining cortical and subcortical surfaces together yielded the highest accuracy of Gf for both ABCD (R = 0.314) and HCP datasets (R = 0.454), outperforming the state-of-the-art prediction of Gf from any other brain measures in the literature. Across both datasets, the morphology of the amygdala, hippocampus, and nucleus accumbens, along with temporal, parietal and cingulate cortex consistently drove the prediction of Gf, suggesting a significant reframing of the relationship between brain morphology and Gf to include systems involved with reward/aversion processing, judgment and decision-making, motivation, and emotion. 
    more » « less
  3. X-ray fluorescence (XRF) spectroscopy is a common technique in the field of heritage science. However, data processing and data interpretation remain a challenge as they are time consuming and often require a priori knowledge of the composition of the materials present in the analyzed objects. For this reason, we developed an open-source, unsupervised dictionary learning algorithm reducing the complexity of large datasets containing 10s of thousands of spectra and identifying patterns. The algorithm runs in Julia, a programming language that allows for faster data processing compared to Python and R. This approach quickly reduces the number of variables and creates correlated elemental maps, characteristic for pigments containing various elements or for pigment mixtures. This alternative approach creates an overcomplete dictionary which is learned from the input data itself, therefore reducing the a priori user knowledge. The feasibility of this method was first confirmed by applying it to a mock-up board containing various known pigment mixtures. The algorithm was then applied to a macro XRF (MA-XRF) data set obtained on an 18th century Mexican painting, and positively identified smalt (pigment characterized by the co-occurrence of cobalt, arsenic, bismuth, nickel, and potassium), mixtures of vermilion and lead white, and two complex conservation materials/interventions. Moreover, the algorithm identified correlated elements that were not identified using the traditional elemental maps approach without image processing. This approach proved very useful as it yielded the same conclusions as the traditional elemental maps approach followed by elemental maps comparison but with a much faster data processing time. Furthermore, no image processing or user manipulation was required to understand elemental correlation. This open-source, open-access, and thus freely available code running in a platform allowing faster processing and larger data sets represents a useful resource to understand better the pigments and mixtures used in historical paintings and their possible various conservation campaigns. 
    more » « less
  4. X-ray fluorescence spectroscopy (XRF) plays an important role for elemental analysis in a wide range of scientific fields, especially in cultural heritage. XRF imaging, which uses a raster scan to acquire spectra pixel-wise across artworks, provides the opportunity for spatial analysis of pigment distributions based on their elemental composition. However, conventional XRF-based pigment identification relies on time-consuming elemental mapping facilitated by the interpretation of measured spectra by experts. To reduce the reliance on manual work, recent studies have applied machine learning techniques to cluster similar XRF spectra in data analysis and to identify the most likely pigments. Nevertheless, it is still challenging to implement automatic pigment identification strategies to directly tackle the complex structure of real paintings, e.g. pigment mixtures and layered pigments. In addition, pigment identification based on XRF on a pixel-by-pixel basis remains an obstacle due to the high noise level. Therefore, we developed a deep-learning based pigment identification framework to fully automate the process. In particular, this method offers high sensitivity to the underlying pigments and to the pigments present in low concentrations, therefore enabling robust mapping of pigments based on single-pixel XRF spectra. As case studies, we applied our framework to lab-prepared mock-up paintings and two 19th-century paintings: Paul Gauguin's Poèmes Barbares (1896) that contains layered pigments with an underlying painting, and Paul Cezanne's The Bathers (1899–1904). The pigment identification results demonstrated that our model achieved comparable results to the analysis by elemental mapping, suggesting the generalizability and stability of our model. 
    more » « less
  5. null (Ed.)
  6. This dataset contains machine learning and volunteer classifications from the Gravity Spy project. It includes glitches from observing runs O1, O2, O3a and O3b that received at least one classification from a registered volunteer in the project. It also indicates glitches that are nominally retired from the project using our default set of retirement parameters, which are described below. See more details in the Gravity Spy Methods paper. 

    When a particular subject in a citizen science project (in this case, glitches from the LIGO datastream) is deemed to be classified sufficiently it is "retired" from the project. For the Gravity Spy project, retirement depends on a combination of both volunteer and machine learning classifications, and a number of parameterizations affect how quickly glitches get retired. For this dataset, we use a default set of retirement parameters, the most important of which are: 

    1. A glitches must be classified by at least 2 registered volunteers
    2. Based on both the initial machine learning classification and volunteer classifications, the glitch has more than a 90% probability of residing in a particular class
    3. Each volunteer classification (weighted by that volunteer's confusion matrix) contains a weight equal to the initial machine learning score when determining the final probability

    The choice of these and other parameterization will affect the accuracy of the retired dataset as well as the number of glitches that are retired, and will be explored in detail in an upcoming publication (Zevin et al. in prep). 

    The dataset can be read in using e.g. Pandas: 
    ```
    import pandas as pd
    dataset = pd.read_hdf('retired_fulldata_min2_max50_ret0p9.hdf5', key='image_db')
    ```
    Each row in the dataframe contains information about a particular glitch in the Gravity Spy dataset. 

    Description of series in dataframe

    • ['1080Lines', '1400Ripples', 'Air_Compressor', 'Blip', 'Chirp', 'Extremely_Loud', 'Helix', 'Koi_Fish', 'Light_Modulation', 'Low_Frequency_Burst', 'Low_Frequency_Lines', 'No_Glitch', 'None_of_the_Above', 'Paired_Doves', 'Power_Line', 'Repeating_Blips', 'Scattered_Light', 'Scratchy', 'Tomte', 'Violin_Mode', 'Wandering_Line', 'Whistle']
      • Machine learning scores for each glitch class in the trained model, which for a particular glitch will sum to unity
    • ['ml_confidence', 'ml_label']
      • Highest machine learning confidence score across all classes for a particular glitch, and the class associated with this score
    • ['gravityspy_id', 'id']
      • Unique identified for each glitch on the Zooniverse platform ('gravityspy_id') and in the Gravity Spy project ('id'), which can be used to link a particular glitch to the full Gravity Spy dataset (which contains GPS times among many other descriptors)
    • ['retired']
      • Marks whether the glitch is retired using our default set of retirement parameters (1=retired, 0=not retired)
    • ['Nclassifications']
      • The total number of classifications performed by registered volunteers on this glitch
    • ['final_score', 'final_label']
      • The final score (weighted combination of machine learning and volunteer classifications) and the most probable type of glitch
    • ['tracks']
      • Array of classification weights that were added to each glitch category due to each volunteer's classification

     

    ```
    For machine learning classifications on all glitches in O1, O2, O3a, and O3b, please see Gravity Spy Machine Learning Classifications on Zenodo

    For the most recently uploaded training set used in Gravity Spy machine learning algorithms, please see Gravity Spy Training Set on Zenodo.

    For detailed information on the training set used for the original Gravity Spy machine learning paper, please see Machine learning for Gravity Spy: Glitch classification and dataset on Zenodo. 

     
    more » « less